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The hexameric MCM complex is the catalytic core of the replicative helicase in eukaryotic 
and archaeal cells. Here we describe the first in vivo analysis of archaeal MCM protein 
structure and function relationships using the genetically tractable haloarchaeon Haloferax 
volcanii as a model system. Hfx. volcanii encodes a single MCM protein that is part 
of the previously identified core group of haloarchaeal MCM proteins. Three structural 
features of the N-terminal domain of the Hfx. volcanii MCM protein were targeted for 
mutagenesis: the p7-p8 and P9-P10 p-hairpin loops and putative zinc binding domain. Five 
strains carrying single point mutations in the p7-p8 p-hairpin loop were constructed, none 
of which displayed impaired cell growth under normal conditions or when treated with 
the DNA damaging agent mitomycin C. However, short sequence deletions within the 
P7-P8 p-hairpin were not tolerated and neither was replacement of the highly conserved 
residue glutamate 187 with alanine. Six strains carrying paired alanine substitutions within 
the P9-P10 p-hairpin loop were constructed, leading to the conclusion that no individual 
amino acid within that hairpin loop is absolutely required for MCM function, although one 
of the mutant strains displays greatly enhanced sensitivity to mitomycin C. Deletions of 
two or four amino acids from the P9-P10 p-hairpin were tolerated but mutants carrying 
larger deletions were inviable. Similarly, it was not possible to construct mutants in 
which any of the conserved zinc binding cysteines was replaced with alanine, underlining 
the likely importance of zinc binding for MCM function. The results of these studies 
demonstrate the feasibility of using Hfx. volcanii as a model system for reverse genetic 
analysis of archaeal MCM protein function and provide important confirmation of the 
in vivo importance of conserved structural features identified by previous bioinformatic, 
biochemical and structural studies. 
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INTRODUCTION 

In all forms of life, successful chromosomal DNA replication 
requires efficient unwinding of the DNA double helix at the 
replication forks, a reaction catalyzed by the replicative DNA 
helicase. In eukaryotes the replicative helicase is the CMG com- 
plex, a tripartite molecular machine composed of Cdc45, MCM 
and GINS (reviewed by Onesti and MacNeill, 2013). The MCM 
(mini-chromosome maintenance) complex is the catalytic core 
of this machine (Vijayraghavan and Schwacha, 2012). MCM is a 
ring-shaped hexamer composed of six related but non-identical 
subunits, each of which is a member of the AAA+ (ATPases 
associated with diverse cellular activities) protein superfamily 
(Duderstadt and Berger, 2008). MCM is loaded onto chromo- 
somal replication origins as a head-to-head double hexamer in 
the Gl phase of the cell cycle in a reaction known as replica- 
tion licensing (Evrin et al., 2009; Remus et al., 2009; Gambus 
et al., 2011). Cdc45 and GINS then assemble at the Gl-S bound- 
ary to form the CMG, activation of which involves MCM subunit 



phosphorylation by DDK (Dbf4-dependent protein kinase) . Once 
activated, individual CMG complexes move with the replication 
forks from origin to inter-origin sequences. 

Consistent with their shared evolutionary history, homologs 
of the major eukaryotic replication factors have been identi- 
fied and characterized in the archaea, including homologs of the 
three components of the CMG. Owing to their high sequence 
similarity to their eukaryotic counterparts, archaeal MCM pro- 
teins were the first to be identified and biochemically character- 
ized. Many archaea encode single MCM proteins that have been 
shown to form — or are presumed to form — homohexameric 
helicase complexes (reviewed by Slaymaker and Chen, 2012). 
The best studied examples of this type of MCM are from the 
euryarchaeon Methanothermobacter thermoautotrophicum and 
the crenarchaeon Sulfolobus solfataricus. However, a number of 
species encode multiple MCM proteins, such as Thermococcus 
kodakarensis and Methanococcus maripaludis, which encode three 
and four, respectively (Walters and Chong, 2010; Ishino et al., 
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2011; Pan et al., 2011). It is not impossible that in some species 
these proteins could form eukaryotic-like heterohexameric com- 
plexes in vivo. Interestingly, only one of the three T. kodakarensis 
MCM proteins is essential for cell viability (Ishino et al., 201 1; Pan 
etal., 2011) 

Structural information is available for a number of archaeal 
MCM proteins and has been used to guide biochemical inves- 
tigations of protein structure-function relationships (reviewed 
by Slaymaker and Chen, 2012). At a structural level, indi- 
vidual archaeal MCM proteins comprise a non-catalytic N- 
terminal domain followed by the catalytic AAA+ domain 
and, at the extreme C-terminus, a short winged helix-turned- 
helix (wHTH) domain. The most extensive crystal structure 
is that of near full-length S. solfataricus MCM which spans 
the N-terminal and catalytic AAA+ domains but not the 
wHTH domain (Brewster et al., 2008). Efforts to determine 
the structure of the wHTH by NMR are ongoing (Wiedemann 
et al., 2013). Additional crystal structures include the N- 
terminal domains of S. solfataricus, M. thermoautotrophicum, 
and Thermoplasma acidophilum MCM proteins, with the lat- 
ter forming a right-handed spiral filament (Fletcher et al., 
2003; Liu et al, 2008; Fu et al., 2014). A left-handed fila- 
ment structure for near full-length S. solfataricus MCM has 
also been determined (Slaymaker et al., 2013), as well as a 
full-length structure of a catalytically inactive MCM homolog 
from Methanopyrus kandleri (Bae et al., 2009). The biologi- 
cal significance, if any, of the filamentous forms remains to be 
determined. 

Unlike the MCM proteins, archaeal GINS and Cdc45 
homologs share only very limited sequence similarity with their 
eukaryotic counterparts (Marinsek et al., 2006). The eukaryotic 
GINS complex is a heterotetramer, comprising the related Sld5, 
Psfl, Psf2, and Psf3 proteins (reviewed by Kamada, 2012). Both 
homotetrameric and heterotetrameric (i.e., dimer of dimer or 
A2B2) complexes have been identified in archaea and the struc- 
ture of the T. kodakarensis A2B2 heterotetrameric GINS has been 
solved (Oyama et al., 201 1). Archaeal Cdc45 homologs have only 
very recently been positively identified as such (Sanchez-Pulido 
and Ponting, 201 1; Krastanova et al, 2012; Makarova et al, 2012). 
These proteins belong to the Reel nuclease branch of the DHH 
hydrolase superfamily. Unlike eukaryotic Cdc45 proteins, at least 
some archaeal Rec)/Cdc45 proteins possess nuclease activity (Li 
et al., 2011; Yuan et al., 2013), the precise function of which is 
unclear. The existence of all three CMG components in archaea 
suggest that these organisms may have a valuable role to play 
as models for dissecting the function of the individual CMG 
components. 

Using multiple sequence alignments and crystal structures 
as a guide, a number of laboratories have reported detailed 
mutagenesis studies of MCM structure -function relationships 
(reviewed by Slaymaker and Chen, 2012). In all cases, the 
effects of the mutations on MCM function were determined 
in vitro, using purified recombinant proteins in various bio- 
chemical assays. To our knowledge, there has been no in vivo 
reverse genetic analysis of the effects of mutations of MCM 
function, largely due to the difficulty or impossibility of con- 
ducting such studies in species where genetic tools are either 



rudimentary or unavailable. In this report we describe the first 
results of reverse genetic analysis of archaeal MCM function 
in vivo, using the haloarchaeal organism Haloferax volcanii as 
a model system. The haloarchaea present a particularly attrac- 
tive model to study archaeal chromosome replication owing to 
the ease with which representative species can be manipulated 
genetically (reviewed by Farkas et al., 2013). Hfx. volcanii in par- 
ticular has proved a highly successful model, with a number of 
components of the Hfx. volcanii replication machinery already 
characterized, including multiple origins of replication (Hawkins 
et al, 2013), origin binding proteins (Norais et al, 2007), single- 
stranded DNA binding proteins (Skowyra and MacNeill, 2012; 
Stroud et al., 2012), the sliding clamp PCNA (Morgunova et al., 
2009; Winter et al, 2009) and both ATP- and NAD-dependent 
DNA ligases (Poidevin and MacNeill, 2006; Zhao et al., 2006). 
In addition to their genetic tractability, over 100 haloarchaeal 
genomes have been now sequenced, offering a wealth of infor- 
mation for comparative protein sequence analysis. Hfx. volcanii 
encodes a single MCM protein, a member of the previously 
defined core group of haloarchaeal MCM proteins discussed fur- 
ther below (MacNeill, 2009). In the work presented here, three 
conserved features of the protein are targeted for mutagenesis: the 
p7-fi8 and fi9-fi 10 P-hairpin loops and the four conserved cys- 
teines of the putative zinc binding domain. The results of these 
studies establish Hfx. volcanii as a valuable model for detailed 
structure-function analysis of MCM helicase and provide confir- 
mation of the importance of these conserved features for MCM 
function in vivo. 

MATERIALS AND METHODS 

DATABASE SEARCHING AND SEQUENCE HANDLING 

Protein sequences were obtained from the UniProt 
Knowledgebase (UniProtKB) database (Magrane and UniProt 
Consortium, 2011): primary accession numbers are listed in 
Table 1. Multiple sequence alignments and phylogenetic trees 
were generated using ClustalX 2.1 (Larkin et al., 2007) and 
njplot (Perriere and Gouy, 1996), respectively. Intein sequences 
were initially identified by visual inspection as large sequence 
insertions in comparative sequence analysis. Inteins boundaries 
were defined by the presence of the N-terminal (block A) and 
C-terminal (block G) intein splicing motifs as defined at the 
InBase intein database (Perler, 2002). 

STRAINS AND GROWTH CONDITIONS 

Hfx. volcanii strains used in this study are listed in Table 2. All 
strains were grown in either Hv-YPC or Hv-CA medium at 
45°C as described in the Halohandbook v7.2 (www.haloarchaea. 
com/resources/halohandbook). For selection procedures, tryp- 
tophan was added to Hv-CA medium at a final concentration 
of 50 u,g/ml. For counter-selection using 5-fluoroorotic acid (5- 
FOA), Hv-CA was supplemented with uracil and 5-FOA at final 
concentrations of 10 |xg/ml and 50 (xg/ml, respectively. For mit- 
omycin C sensitivity assays, wild-type (H26) and mutant cells 
were grown at 45° C in Hv-YPC medium to an ODgsonm of 0.2- 
0.32, before being serially diluted in 18% SW. 5 ul aliquots were 
then spotted on Hv-YPC plates containing 0, 10, 20, or 30ng/ml 
mitomycin C and incubated at 45°C for 5 days. 
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Table 1 | List of MCM proteins analyzed in this study. 



Key to Figure 1 


Species 


UniProtKB accession number 


Length (amino acids) 


Inteins 
A B C D 


1 


Haladaptatus paucihalophilus DX253 


E7QNU9 


698 




2 


Halalkalicoccus jeotgali B3 


D8J3U5 


700 




3A 


Haloarcula marismortui ATCC 43049 


Q5UYX8 


1175 


+ 


3B 




Q5V011 


681 




3C 




Q5V814 


649 




4 


Halobacterium salinarum R 1 


B0R796 


879 


+ 


5A 


Halobiforma lacisalsi AJ5 


M0LR18 


1342 


+ 


5B 




M0LZ47 


312 




6 


Halococcus thailandensis JCM 13552 


M0N8X7 


698 




7 


Haloferax volcanii DS2 


D4GZG5 


702 




8 


Halogeometricum borinquense DSM 11551 


E4NRK9 


1818 


+ + 


9 


Halogranum salarium B-1 


J2ZGA0 


700 




10A 


Halomicrobium mukohataei DSM 12286 


C7P1F9 


873 


+ 


10B 




C7NZ81 


458 




11 A 


Halopiger xanaduensis SH-6 


F8D3Z0 


702 




11 B 




F8DEM3 


698 




11C 




F8DET6 


315 




12 


Haloquadratum walsbyi HBSQ001 


Q18E84 


2216 


+ + + + 


13 


Halorhabdus utahensis AX-2 


C7NUH7 


1412 


+ + 


14A 


Halorubrum lacusprofundi DSM 5036 


B9LTB1 


700 




14B 




B9LUI3 


717 




15 


Halosarcina pallida JCM 14848 


M0D6S0 


1172 


+ 


16A 


Halosimplex carlsbadense 2-9-1 


M0D2C8 


698 




16B 




M0CE93 


712 




17A 


Haloterrigena turkmenica DSM 5511 


D2RUS4 


700 




17B 




D2S3H9 


314 




18 


Halovivax asiaticus JCM 14624 


M0BKPO 


876 


+ 


19A 


Natrialba asiatica DSM 12278 


M0AKM2 


700 




19B 




M0AIF1 


315 




20 


Natrinema pallidum DSM 3751 


L9Z845 


700 




21A 


Natronobacterium gregoryi SP2 


L0AHL1 


1172 


+ 


21 B 




L0AN78 


709 




21C 




L0ANL5 


662 




22 


Natronococcus amylolyticus DSM 10524 


L9XCC3 


1404 


+ + 


23 


Natronolimnobius innermongolicus JCM 12255 


L9X2E8 


700 




24A 


Natronomonas pharaonis DSM 2160 


Q3IML4 


1037 


+ 


24B 




Q3IPB6 


676 




25 


Natronorubrum tibetense GA33 


L9VHE4 


1814 


+ + 


26 


Salinarchaeum sp. Harcht-Bsk1 


R4W989 


697 






Methanosarcina acetivorans C2A 


Q8TSW4 


701 






Methanothermobacter thermautotrophicus AH 


027798 


666 






Sulfolobus solfataricus P2 


Q9UXG1 


686 






Cenarchaeum symbiosum A 


A0RYB8 


697 






Korarchaeum cryptofilum 0PF8 


B1L6L9 


703 






Thermoplasma acidophilum DSM 1728 


Q9HK10 


698 






Archaeoglobus tulgidus DSM 43U4 




586 






Homo sapiens Mcm2 


P49736 


904 






Homo sapiens Mem 3 


P25205 


808 






Homo sapiens Mcm4 


P33991 


863 






Homo sapiens Mem 5 


P33992 


734 






Homo sapiens Mcm6 


Q14566 


821 






Homo sapiens Mcm7 


P33993 


719 




The numbers shown in the first column provide a key for the proteins represented in the phylogenetic tree shown in Figure 1. The plus signs in the final four 
columns indicate the present of intein insertions at each of the four conserved positions A-D shown in Figure 2. Protein 3A, for example, Q5UYX8 from Haloarcula 



marismortui, has a single intein at position A. 
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Table 2 | Haloferax volcanii strains used in this study. 



Strain no. 


Genotype 


Notes 


References 


SMH693 




Wild-type strain DS70 


Wendoloski 
et al., zUUl 


c i\ A i_i r*o/~\ 


ApyrE2 


C"+ I ^ i lor' 

btrain nzb 


ANers et al., 2004 


bMHbbb 


mcm-rM1 ApyrE2 


Single alanine substitution Q186A 


This study 


C A l_l CC A 

bMrlbb4 


mcm-rM3 ApyrE2 


Single alanine substitution E190A 


This study 


c aucco 
bMHbbo 


mcm-rM4 ApyrE2 


Single alanine substitution R193A 


This study 


c i\ A i_i rrn 

bMHbbU 


mcm-rM5 ApyrE2 


Single alanine substitution E196A 


This study 


o fv yi i i ceo 

blVIHbbz 


mcm-rM6 ApyrE2 


Single alanine substitution Q199A 


This study 


C\/IUCOO 

blvlnboo 


mcm-bH1 ApyrE2 


raireu alanine supstitution nZzoA/lzz/A 


This study 


olVlnb4U 


mcm-unz Apyrtz 


raireu alanine suustitution tzzoA/uzzyA 


This study 


SMH642 


mcm-bH3 ApyrE2 


Paired alanine substitution Q230A/T231A 


This study 


SMH644 


mcm-bH4 ApyrE2 


Paired alanine substitution S232A/G233A 


This study 


SMH646 


mcm-bH5 ApyrE2 


Paired alanine substitution N234A/E235A 


This study 


SMH648 


mcm-bH6 ApyrE2 


Paired alanine substitution K236A/T237A 


This study 


SMH649 


mcm-hD1 ApyrE2 


Two amino acid deletion T231/S232 


This study 


SMH650 


mcm-hD2 ApyrE2 


Four amino acid deletion Q230/T231/S232/G233 


This study 


SMH652 


mcm-S1 ApyrE2 


Silent Acc65\ restriction site replacing codons 141 and 142 


This study 



MOLECULAR CLONING REAGENTS 

Enzymes for molecular cloning were purchased from New 
England Biolabs (NEB), Promega or Fermentas. Oligonucleotides 
were synthesized by DNA Technology A/S (Risskov, Denmark). 
DNA sequencing was performed by Eurofms MWG Operon 
(Ebersberg, Germany). DNA purification kits were from Qiagen. 
PCR amplification was performed using the GC-rich PCR system 
(Roche) with Taq polymerase (NEB) substituting for the GC-rich 
enzyme as necessary. For routine cloning purposes, E.coli DH5a 
{fhuA2 A(argF-lacZ)U169phoAglnV44 OS0 A(lacZ)M15 gyrA96 
recAl relAl endAl thi-1 hsdR17) was used (Invitrogen). To pre- 
pare unmethylated plasmid DNA for Hfx. volcanii transforma- 
tion, E.coli GM121 (F- dam-3 dcm-6 ara-14fhuA31 galK2 galT22 
hdsR3 lacYl leu-6 thi-1 thr-1 tsx-78) was used. 

CONSTRUCTION OF MUTANT Hfx. Volcanii STRAINS 

Mutant strains were constructed using the pop-in/pop-out 
method (Bitan-Banin et al., 2003) in Hfx. volcanii ApyrE2 strain 
H26 as follows. First, plasmid pTA131-HfxMCM-HXba was con- 
structed by using oligonucleotide primers HfxMCM-5H and 
HfxMCM-3X (designed to include Hindlll and Xbal sites respec- 
tively, see Table 3) to amplify a 1.0 kb fragment of Hfx. vol- 
canii genomic DNA spanning the region from 100 nucleotides 
upstream of the mem ORF to 900 nucleotides inside the ORE 
The PCR product was digested with Hindlll and Xbal, cloned into 
plasmid pTA13 1 digested with the same two enzymes (Allers et al., 
2004) and sequenced to confirm the absence of PCR errors. 

The resulting plasmid (pTA131-HfxMCM-HXba) was then 
used as a template for PCR overlap extension mutagenesis (OEM) 
to create 25 mutant derivatives. The sequences of the muta- 
genic primers are given in Table 3. OEM was performed using 
the GC-Rich PCR system buffer and GC-Rich enzyme mix 
(Roche) for the first round of PCR and the GC-Rich PCR sys- 
tem buffer and Taq polymerase (NEB) for the second round. 
Oligonucleotides HfxMCM-5H and HfxMCM-3X were used as 



the flanking primers throughout. The resulting PCR products 
were then restricted with Bsu36l and BsfEII (for fi7-fi8 ^-hairpin 
and zinc binding domain mutants) or BstEll and Xbal (for f59- 
F310 P-hairpin mutants) and the 329 bp Ssi(36I-SsfEII or 271 bp 
BstEll-Xbal pieces carrying the mutations cloned back into 
pTA131-HfxMCM-HXba from which the corresponding wild- 
type restriction fragments had been removed. Plasmids were 
again sequenced to confirm the absence of unwanted sequence 
changes before being passaged through E.coli GM121 to generate 
unmethylated DNA for transformation into Hfx. volcanii ApyrE2 
strain H26 (Table 2). 

Transformation of Hfx.volcanii was accomplished as described 
in the Halohandbook v7.2 (www.haloarchaea.com/resources/ 
halohandbook). Transformants obtained on Hv-CA medium 
lacking uracil were grown for 30 generations at 45° C in non- 
selective Hv-YPC medium to allow loss of the plasmid before 
being plated on Hv-CA plates containing 50 (xg/ml 5-FOA and 
a 10 u,g/ml uracil. Colonies formed on these plates were then 
inoculated into 500 [jlI of Hv-YPC liquid medium and grown 
overnight at 45°C. Genomic DNA from these was prepared 
by taking 10 (xl of overnight culture, adding to 500 ul of ster- 
ile water and heating to 70° C for lOmin to lyse the cells. 
For the P-hairpin and zinc binding domain mutants, 1 of 
the resulting mix was used in PCR reactions to screen for 
the presence of the mutants in the chromosome. To discrimi- 
nate between wild-type and mutant sequences, oligonucleotide 
primers with mismatched 3' sequences were used in PCR reac- 
tions in conjunction with primer HFXMCM-R1150, which lies 
1150 nucleotides into the mem ORF and which is therefore not 
present in pTA131-HfxMCM-HXba (see Table 3 for oligonu- 
cleotide sequences). For the silent restriction site mutant, the 
partial mem ORF was amplified using primers HfxMCM-5H 
and HfxMCM-3X and the PCR products digested with Acc65I 
to identify mutants. In all cases, putative positive clones were 
then re-streaked twice to single colonies on Hv-YPC agar at 
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Table 3 | Oligonucleotides used in this study. 



Table 3 | Continued 



Primer name Sequence 

A. FOR CONSTRUCTION OF pTA131-HfxMCM-HXba 

HfxMCM-5H 5'-GTGTGTGTGT AAGCTT CCTCCGCGAGGCGACGGA-3' 
HfxMCM-3X 5'-GGTGGTGGTGTCJFAGACATGGCAATCTTCTCCTG-3' 

B. FOR CONSTRUCTION OF 07-08 0-HAIRPIN MUTANTS BY OEM 

5'-AAACTGCGCGTCGCCGAGTCCCCCGAGGGC-3' 
5'-GCCCTCGGGGGACTCGGCGACGCGCAGTTT-3' 
5'-AAACTGCGCGTCCAGGCCTCCCCCGAGGGCCTGCG-3' 
5'-CGCAGGCCCTCGGGGGAGGCCTGGACGCGCAGTTT-3' 
5'-CAGGAGTCCCCCGCCGGCCTGCGCGGGGGC-3' 
5'-GCCCCCGCGCAGGCCGGCGGGGGACTCCTG-3' 
5'-CCCGAGGGCCTGGCGGGGGGCGAGACGCCG-3' 
5'-CGGCGTCTCGCCCCCCGCCAGGCCCTCGGG-3' 
5'-CTGCGCGGGGGCGCCACGCCGCAGAGCATC-3' 
5'-GATGCTCTGCGGCGTGGCGCCCCCGCGCAG-3' 
5'-GGCGAGACGCCGGCCAGCATCGACATCAAC-3' 
5'-GTTGATGTCGATGCTG GCCG GCGTCTCG CC-3' 
5'-CAGGAGTCCCCCGAG-GGCGAGACGCCGCAG-3' 
5'-CTGCGGCGTCTCGCC-CTCGGGGGACTCCTG-3' 
5'-GTCCAGGAGTCCCCC-GAGACGCCGCAGAGC-3' 
5'-GCTCTGCGGCGTCTC-GGGGGACTCCTGGAC-3' 
5'-CGCGTCCAGGAGTCC-ACGCCGCAGAGCATC-3' 
5'-GATGCTCTGCGGCGT-GGACTCCTGGACGCG-3' 
5'-CTGCGCGTCCAGGAG-CCGCAGAGCATCGAC-3' 
5'-GTCGATGCTCTGCGG-CTCCTGGACGCGCAG-3' 

C. FOR CONSTRUCTION OF 09-010 0-HAIRPIN MUTANTS BY OEM 



Primer name 



Sequence 



HFXMCM- 


-rM1-F 


HFXMCM- 


-rM1-R 


HFXMCM- 


-rM2-F 


HFXMCM- 


-rM2-R 


HFXMCM- 


-rM3-F 


HFXMCM- 


-rM3-R 


HFXMCM- 


-rM4-F 


HFXMCM 


-rM4-R 


HFXMCM 


-rM5-F 


HFXMCM 


-rM5-R 


HFXMCM 


-rM6-F 


HFXMCM 


-rM6-R 


HFXMCM 


-rD1-F 


HFXMCM 


-rD1-R 


HFXMCM 


-rD2-F 


HFXMCM 


-rD2-R 


HFXMCM 


-rD3-F 


HFXMCM 


-rD3-R 


HFXMCM 


-rD4-F 


HFXMCM 


-rD4-R 



HFXMCM-bH1-F 
HFXMCM-bH1-R 
HFXMCM-bH2-F 
HFXMCM-bH2-R 
HFXMCM-bH3-F 
HFXMCM-bH3-R 
HFXMCM-bH4-F 
HFXMCM-bH4-R 
HFXMCM-bH5-F 
HFXMCM-bH5-R 
HFXMCM-bH6-F 
HFXMCM-bH6-R 
HFXMCM-hD1-F 
HFXMCM-hD1-R 
HFXMCM-hD2-F 
HFXMCM-hD2-R 
HFXMCM-hD3-F 
HFXMCM-hD3-R 
HFXMCM-hD4-F 
HFXMCM-hD4-R 



5'-GTCGGCGTCCTC GCAGCG GAACAGCAGACATCG-3' 

5'-CGATGTCTGCTGTTCCGCTGCGAGGACGCCGAC-3' 

5'-GTCCTCCACATCGCAGCGCAGACATCGGGCAAC-3' 

5'-GTTGCCCGATGTCTGCGCTGCGATGTGGAGGAC-3' 

5'-CACATCGAACAGGCAGCGTCGGGCAACGAGAAG-3' 

5'-CTTCTCGTTGCCCGACGCTGCCTGTTCGATGTG-3' 

5'-GAACAGCAGACAGCAGCGAACGAGAAGACGCCC-3' 

5'-GGGCGTCTTCTCGTTCGCTGCTGTCTGCTGTTC-3' 

5'-CAGACATCGGGC GCAGCG AAGACGCCCGTCTTC-3' 

5'-GAAGACGGGCGTCTTCGCTGCGCCCGATGTCTG-3' 

5'-TCGGGCAACGAGGCAGCGCCCGTCTTCGACTAC-3' 



5'-GTAGTCGAAGACGGGCGCTGCCTCGTTGCCCGA-3' 
5'-CACATCGAACAGCAG-GGCAACGAGAAGACG-3' 
5'-CGTCTTCTCGTTGCC-CTGCTGTTCGATGTG-3' 
5'-CTCCACATCGAACAG-AACGAGAAGACGCCC-3' 
5'-GGGCGTCTTCTCGTT-CTGTTCGATGTGGAG-3' 
5'-GTCCTCCACATCGAA-GAGAAGACGCCCGTC-3' 
5'-GACGGGCGTCTTCTC-TTCGATGTGGAGGAC-3' 
5'-GGCGTCCTCCACATC-AAGACGCCCGTCTTC-3' 
5'-GAAGACGGGCGTCTT-GATGTGGAGGACGCC-3' 
D. FOR CONSTRUCTION OF ZINC BINDING DOMAIN MUTANTS BY OEM 



HFXMCM-C137A-F 
HFXMCM-C137A-R 
HFXMCM-C140A-F 
HFXMCM-C140A-R 
HFXMCM-C159A-F 
HFXMCM-C159A-R 
HFXMCM-C162A-F 
HFXMCM-C162A-R 



5'-CCGCCTTCGAGGCGCAGCGCTGCGG-3' 
5'-CCGCAGCGCTGCGCCTCGAAGGCGG-3' 
5'-AGTGCCAGCGCGCGGGGACGATGAG-3' 
5'-CTCATCGTCCCCGCGCGCTGGCACT-3' 
5'-AACCCCACGAGGCGCAGGGATGCGA-3'5' 
TCGCATCCCTGCGCCTCGTGGGGTT-3' 
5'-AGTGTCAGGGAGCGGAGCGCCAGGG-3' 
5'-CCCTGGCGCTCCGCTCCCTGACACT-3' 
E. FOR CONSTRUCTION OF SILENT Acc65l mutation by OEM 
HFXMCM-S1-F 5'-TG CCAG CG CTG CGGTACC ATG AG CTACATC-3' 

HFXMCM-S1-R 5'-GATGTAGCTCATGGTACCGCAGCGCTGGCA-3' 



F. FOR DETECTION OF 07-08 0-HAIRPIN MUTANTS 



i i r~\/ ft a o ft a 


tMI-WT 


5 -CAGAAACTGCGCGTCCAG-3 


Li r-vi\ /I / — ■ i\ ji 
HFXMCM- 


-rMl-MU 1 


o -CAGAAAC I GCGCG I CGCC-3 


HFXMCM- 


- l\ A O V A /~r 

-rM2-WT 


5 -AAACTGCGCGTCCAGGAG-3 


l 1 1 — v/ rv >l ^ rv >l 

HFXMCM- 


-l\ A O Ty A\ IT 

-rM2-MUT 


5 -AAACTCCGCCTCCAGGCC-3 


HFXMCM- 


TM J-VV 1 


5 -G I CCAGGAG I CCCCCGAG-3 


II rvN /l/Ti /I 

HFXMCM- 


.-Pi Ad Pi A 1 IT 

TMJ-MU 1 


0 -G I CCAGGAG I CCCCCGCC-3 


rlrAIVIL.IvT 


rKAA \A/T 

-IIVI4-VV 1 


O - 1 L^L.L.L.L,LiML]L:L]L,L, I LjL.LdL.-o 


HFXMCM 


-rM4-MUT 


5'-TCCCCCGAGGGCCTGGCG-3' 


HFXMCM 


-rM5-WT 


5'-GGCCTGCGCGGGGGCGAG-3' 


HFXMCM 


-I-M5-MUT 


5'-GGCCTGCGCGGGGGCGCC-3' 


HFXMCM 


-rM6-WT 


5'-GGGGGCGAGACGCCGCAG-3' 


HFXMCM- 


-rM6-MUT 


5'-GGGGGCGAGACGCCGGCC-3' 


HFXMCM 


-rD1-MUT 


5'-GGAGTCCCCCGAGGGCGAG-3' 


HFXMCM 


-rD2-MUT 


5'-CCAG G AGTCCCCCGAGAC-3' 


HFXMCM 


-I-D3-MUT 


5'-CGTCCAGGAGTCCACGCC-3' 


HFXMCM- 


-rD4-MUT 


5'-CTGCGCGTCCAGGAGCCG-3' 


G. FOR DETECTION OF 09-010 0-HAIRPIN MUTANTS 


HFXMCM- 


■bH1-WT 


5'-GTCGGCGTCCTCCACATC-3' 


HFXMCM 


-bH1-MUT 


5'-GTCGGCGTCCTCGCAGCG-3' 


HFXMCM 


-bH2-WT 


5'-GTCCTCCACATCGAACAG-3' 


HFXMCM 


-bH2-MUT 


5'-GTCCTCCACATCGCAGCG-3' 


HFXMCM 


-bH3-WT 


5'-CACATCGAACAGCAGACA-3' 


HFXMCM 


-bH3-MUT 


5'-CACATCGAACAGGCAGCG-3' 


HFXMCM 


-bH4-WT 


5'-GAACAGCAGACATCGGGC-3' 


HFXMCM 


-bH4-MUT 


5'-GAACAGCAGACAGCAGCG-3' 


HFXMCM 


-bH5-WT 


5'-CAGACATCGGGCAACGAG-3' 


HFXMCM 


-bH5-MUT 


5'-CAGACATCGGGCGCAGCG-3' 


HFXMCM 


-bH6-WT 


5'-ACATCGGGCAACGAGAAGA-3' 


HFXMCM 


-bH6-MUT 


5'-ACATCGGGCAACGAGGCAG-3' 


HFXMCM 


-hD1-WT 


5'-CACATCGAACAGCAGACA-3' 


HFXMCM 


-hD1-MUT 


5'-C AC ATCG AAC AG CAG G G C-3' 


HFXMCM 


-hD2-WT 


5'-CTCCACATCGAACAGCAG-3' 


HFXMCM 


-hD2-MUT 


5'-CTCCACATCGAACAGAAC-3' 


HFXMCM- 


-hD3-WT 


5'-TCCTCCACATCGAACAGC-3' 


HFXMCM 


-hD3-MUT 


5'-TCCTCCACATCGAAGAGA-3' 


HFXMCM 


-hD4-WT 


5'-GGCGTCCTCCACATCGAA-3' 


HFXMCM- 


-hD4-MUT 


5'-GGCGTCCTCCACATCAAG-3' 



H. FOR DETECTION OF ZINC BINDING DOMAIN MUTANTS 



HFXMCM-C137A-WT 


5' 


-GAAGCCGCCTTCGAGTGC-3' 


HFXMCM-C137A-MUT 


5' 


-GAAGCCGCCTTCGAGGCG-3' 


HFXMCM-C140A-WT 


5' 


-TTCGAGTGCCAGCGCTGC-3' 


HFXMCM-C140A-MUT 


5' 


-TTCGAGTGCCAGCGCGCG-3' 


HFXMCM-C159A-WT 


5' 


-CAGGAACCCCACGAGTGT-3' 


HFXMCM-C159A-MUT 


5' 


-CAGGAACCCCACGAGGCG-3' 


HFXMCM-C162A-VVT 


5' 


-CACGAGTGTCAGGGATGC-3' 


HFXMCM-C162A-MUT 


5' 


-CACGAGTGTCAGGGAGCG-3' 



I. REVERSE PRIMER FOR MUTANT DETECTION 

H FXMCM-R1 1 50 5'-GCGAATCCGCGAGCCGTC-3' 



Restriction sites in oligonucleotide primers used for pTA131-HfxMCM-HXba con- 
struction underlined. Oligonucleotide primers for overlap extension mutagenesis 
(OEM) are shown in top strand-bottom strand pairs with the mutated bases 
underlined in the top strand primer (for amino acid substitutions) or with deletion 
boundaries indicated with a dash. Oligonucleotide primers for mutant detection 
(Continued) are shown with 3' mismatched bases underlined. 
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FIGURE 1 | Phylogenetic tree of representative haloarchaeal MCM 
proteins. The sequences of 39 proteins from 26 haloarchaeal genera were 
compared using ClustalX 2.1 (Larkin et al., 2007) and an unrooted 
phylogenetic tree generated using njplot (Perriere and Gouy, 1996). 
Twenty-six proteins (one from each included species) form a core group 
with the remaining 13 proteins designated as outliers. Individual proteins 
are numbered according to Table 1. Accession numbers can also be found 
in Table 1. 



45°C before being re-tested by PCR using wild-type and mutant 
primers. Finally, the presence of the mutation was confirmed by 
sequencing of the relevant part of the mem ORF amplified by PCR 
from genomic DNA (see Supplementary Information for DNA 
sequence traces). 

RESULTS 

HALOARCHAEAL MCM PROTEINS 

Prior to embarking on reverse genetic analysis of Hfx. volcanii 
MCM protein function, we undertook a detailed bioinformatic 
analysis of MCM protein distribution and conservation across the 
haloarchaea. To identify haloarchaeal MCM proteins, we searched 
the UniProtKB sequence database using BLAST with the Hfx. 
volcanii MCM protein (HVO_0220, UniProtKB accession num- 
ber D4GZG5) as the query sequence. Almost 200 proteins were 
identified in ~ 120 species belonging to 26 different genera of the 
Halobacteriaceae. To simplify further analysis, we selected a single 
species as a representative of each genus. These 26 species encoded 
a total of 39 MCM proteins. A complete list of the species under 
investigation, together with accession numbers for the proteins 
we identified, can be found in Table 1. 

To investigate the relationship between the proteins in greater 
detail, we constructed multiple protein sequence alignments 
using ClustalX 2.1 (Larkin et al., 2007). Previously, when 
analysing a significantly smaller number of genomes (five), we 
identified a core group of five MCM proteins and a further three 
proteins that we designated as outliers (MacNeill, 2009). We per- 
formed a similar analysis of the larger dataset, using ClustalX 
to generate multiple sequence alignments of protein sequences 
(from which inteins were first removed — see below) and njplot 
(Perriere and Gouy, 1996) to generate unrooted phylogenetic 
trees. Figure 1 shows the phylogenetic tree for the 39 proteins. 
As before, two groups are apparent, corresponding to the core 
and outlier groups defined previously (MacNeill, 2009). Each of 
the 26 species encodes a single member of the core group that 
includes the Hfx. volcanii MCM protein. The core group MCM 
proteins range in length from 697 to 702 amino acids (after inteins 
are removed — see below) and display a minimum pairwise pro- 
tein sequence identity of 68% (range 68-95%). Three hundred 
and twenty four residues (46% of the protein sequence) are abso- 
lutely conserved across all 26 core group proteins. This includes 
the key catalytic residues that make up the Walker A (P-loop) 
motif (GDPGTGKS in all 26 proteins compared to the consen- 
sus GXjGKS/T), the Walker B motif (DELD in all 26 proteins) 
and arginine finger motif (SRF in all 26 proteins). 

The outlier group proteins display much greater variety in 
length (312-717 amino acids) and sequence similarity (24-48% 
identity with Hfx. volcanii MCM). The four shortest proteins 
(312-315 amino acids) are made up of sequences related to 
the non-catalytic N-terminal domain of MCM only and cannot 
possess helicase activity. The remaining nine outliers all possess 
conserved Walker B (DEL/ID) and arginine finger (SRF) motifs, 
as well as the key lysine in the Walker A (P-loop) motif, sug- 
gesting that these proteins may well have ATPase and/or helicase 
activities. To date, none has been characterized biochemically. 

Inteins are parasitic genetic elements capable of efficient self- 
splicing at the protein level (Gogarten and Hilario, 2006) and are 



a common feature of the haloarchaeal MCM proteins. Amongst 
the 39 proteins, inteins were found in 13, all members of the 
core group defined above, and at four different locations within 
the C-terminal catalytic domain of the protein, with the largest 
number of inteins in an individual MCM protein being four 
(Table 1). Figure 2 shows the position of the four inteins relative 
to conserved sequence features found in archaeal MCM proteins. 
As noted previously, inteins are frequently located at or very 
near to highly conserved and functionally important sequence 
regions (Gogarten and Hilario, 2006) and the haloarchaeal MCM 
inteins are no exception: intein insertion site A is located imme- 
diately C-terminal to the essential lysine in the Walker A sequence 
GDPGTGKS mentioned above, while intein C lies just four 
amino acids C-terminal to the Walker B sequence DELD (Table 1, 
Figure 2). Intein splicing is therefore likely to be essential for 
protein function. 

In a number of archaeal organisms, including representatives 
of the crenarchaea, euryarchaea, thaumarchaea and korarchaea, 
the gene encoding MCM is found adjacent to that encoding a 
GINS subunit (MacNeill, 2010). To ask whether this arrangement 
is also found in any of the species under investigation in this 
report, we examined the chromosomal context of each of the 26 
core MCM genes and also the 13 outlier proteins. None of the 
genes encoding core group proteins is located adjacent to a gene 
encoding GINS or indeed, to any known replication gene (data 
not shown). A similar situation is seen with genes encoding 12 
of the 13 outlier proteins, the sole exception being the pNG3053 
protein (UniProtKB accession number Q5V814, labeled 3C in 
Figure 1) encoded by plasmid pNG300 of Haloarcula maris- 
mortui which is located immediately 3' to an ORF encoding 
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FIGURE 2 | The Haloferax volcanii MCM protein. Schematic 
representation showing domain organization of the Hfx. volcanii MCM 
protein, highlighting the location of key structural motifs. The locations of 
intein insertions (from right to left, inteins A-D, see Table 1) in other 
haloarchaeal MCM proteins (not in Hfx. volcanii MCM, which is intein-free) 
are indicated by the open circles at positions 335, 362, 401 , and 41 5. 



the C-terminally truncated GINS homolog pNG3052 (data not 
shown). 

REVERSE GENETIC ANALYSIS OF MCM FUNCTION 

Hfx. volcanii encodes a single intein-free MCM protein of 702 
amino acids in length (Table 1) that comprises an N-terminal 
domain that spans residues 1-283, an AAA+ catalytic core 
domain spanning residues 283-632 and a C-terminal winged 
helix-turn-helix (wHTH) domain spanning residues 633-702. All 
the conserved sequence features characteristic of archaeal MCM 
proteins are found in the Hfx. volcanii protein (Figure 2). In order 
to test whether Hfx. volcanii was a workable model for reverse 
genetic analysis of archaeal MCM function, three of these con- 
served features were targeted for mutagenesis: the p 1 7-f$8 p 1 -hairpin 
loop (also known as the allosteric communication loop, ACL), 
the P9-{510 f5-hairpin loop and the putative zinc binding domain 
(Figure 2). 

To introduce mutations into the mem gene, we used the pop- 
in/pop-out method (Bitan-Banin et al., 2003). To achieve this, 
a plasmid was constructed carrying a 1.0 kb region of the Hfx. 
volcanii genome spanning the first 900 nucleotides of the mem 
open reading frame together with 100 nucleotides of 5' flank- 
ing region. The plasmid also carries the pyrE2 selectable marker 
{pyrE2 function is required for uracil prototrophy) but does not 
possess a replication origin capable of promoting autonomous 
replication in Hfx. volcanii. Stable maintenance therefore requires 
that the plasmid integrates (pops-in) into the Hfx. volcanii 
genome by means of homologous recombination between the 
mem sequences on the plasmid and the mem gene in the chro- 
mosome. PCR overlap extension mutagenesis was used to gen- 
erate a series of mutated forms of the plasmid in which the 
targeted amino acids were either replaced, singly or in pairs, 
with one or two alanine residues respectively, or deleted alto- 
gether. Next, the plasmids were introduced into Hfx. volcanii 
ApyrE2 strain H26 by standard methods and transformant (pop- 
in) colonies obtained on Hv-CA plates lacking uracil. Multiple 
independent colonies were then individually picked and grown 
in non-selective (uracil-containing) Hv-YPC medium before 



being plated onto Hv-CA plates containing 5-fluoroorotic acid 
to select for (pop-out) clones that had lost the plasmid (see 
Materials and Methods). Pop-out colonies were then screened by 
PCR using oligonucleotide primers specific for either the wild- 
type or mutant sequences (see Table 3 for primer sequences). 
Candidate mutant strains were sequentially re-streaked twice 
on non-selective medium (Hv-YPC) before the presence of the 
mutation was confirmed by PCR using wild-type- and mutant- 
specific primers and by sequencing of amplified genomic DNA 
(see Materials and Methods). Table 2 lists the strains constructed 
by this method. 

MUTAGENESIS OF THE p7-|58 p-HAIRPIN LOOP 

First identified on the basis of its high degree of sequence con- 
servation across species, the pV-p"8 ^-hairpin loop (also known as 
the allosteric communication loop, ACL) is located at the inter- 
face between the N-terminal and AAA+ catalytic domains of the 
MCM protein (Figure 3). A number of mutants in p7-p"8 loop 
have previously been analyzed biochemically, leading to the con- 
clusion that the loop plays a role in coupling the activities of 
these two domains (Sakakibara et al., 2008; Barry et al., 2009). 
The mutations tested include six single point mutants in recombi- 
nant M. thermoautotrophicum MCM (including four at the boxed 
conserved residues shown in Figure 3B) as well as a triple point 
mutant and a complete replacement of the loop by the tripep- 
tide serine-asparagine-glycine in S. solfataricus MCM (Sakakibara 
et al, 2008; Barry et al, 2009). 

In order to test whether the function of the pV-p"8 loop 
function was essential in vivo in Hfx. volcanii, we attempted to 
construct 10 different alleles (Figure 3): six single point mutants 
in which individual charged or bulky polar amino acids within 
the loop were replaced with alanine (mutants mcm-rMl — mcm- 
rM6) and four deletions, of two, four, six, and eight amino acids 
(mutants mcm-rDl — mcm-rD4, respectively). Two of the individ- 
ual amino acids targeted (Q186 and E187) are conserved across 
species (Figure 3B); the equivalent residues were mutated in M. 
thermoautotrophicum MCM (Sakakibara et al., 2008). 

Of the 10 mutants, five (mcm-rMl, mcm-rM3 — mcm-rM6) 
were readily identified by PCR screening using genomic DNA 
templates and oligonucleotide primers specific for the wild- 
type and mutant sequences (see Table 3) and confirmed by 
DNA sequencing (Supplementary Figure SI). Despite extensive 
screening (see Materials and methods), the remaining four strains 
(mcm-rM2, mcm-rDl — mcm-rD4) could not be isolated, suggest- 
ing that these mutations either inactivate or significantly impair 
the function of the Hfx. volcanii MCM protein in cells grown at 
45°C. Growth of the five viable mutants was indistinguishable 
from the parental wild-type strain H26 (Figure 4A, upper panel). 
These results indicate that the p"7-|38 loop is indeed essential for 
Hfx. volcanii MCM function although it is also clear that some 
point mutations (including replacement of conserved amino acid 
glutamine 186 with alanine) can be tolerated. 

MUTAGENESIS OF THE 09-010 p-HAIRPIN LOOP 

Structural analysis of archaeal MCM proteins identifies four p"- 
hairpins in each monomer, three of which protrude, to a greater 
or lesser extent, into the central channel through which single- 
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FIGURE 3 | Mutagenesis of the p7-p8 p-hairpin loop. (A) Two views of 
the three-dimensional structure of the S. solfatahcus MCM N-terminal 
domain hexamer (PDB entry 2VL6) with the p7-p8 p-hairpin loop colored in 
blue. (B) Alignment of p7-p8 p-hairpin loop region from MCM proteins from 
diverse archaeal species (Hvo, Haloferax volcanii; Mac, Methanosarcina 
acetivorans; Mth, Methanothermobacter thermoautotrophicus; Sso, 
Sulfolobus solfataricus; Csy, Cenarchaeum symbiosum; Kcr, Korarchaeum 
cryptophilum; Tac, Thermoplasma acidophilum; Afu, Archaeoglobus 
fulgidus) and from human. Conserved amino acids are boxed. Detailed 
strain designations and protein accession numbers can be found in Table 1. 
(C) Close-up view of p7-p8 p-hairpin loop in S. solfataricus MCM (PDB 
2VL6). (D) Location and nature of intended mutations in Hfx. volcanii MCM 
protein. Attempts were made to construct six single amino acid 
substitutions (rM1-6) and four deletions (rD1-4). See text for details. 
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FIGURE 4 | Growth of mutant mem strains. Mutant and wild-type (H26) 
strains were grown to mid-log phase in Hv-YPC medium (ODesonm of 
0.2-0.32) before being serially diluted in 18% SW and spotted onto Hv-YPC 
plates (part A) or Hv-YPC plates containing 0, 10, 20, or 30 ng/ml mitomycin 
C (MMC, part B, only H26 and mcm-bH5 are shown). The plates were then 
incubated for 5 days at 45°C. p9-pi0 p-hairpin loop mutant mcm-bH5 is 
significantly more sensitive to MMC than wild-type (H26). 



or double-stranded DNA is thought to pass (Slaymaker and 
Chen, 2012). One of these hairpins, the positively charged |39- 
fHO ^-hairpin (also known as the NT-hairpin) is located in the 
N-terminal domain of the protein (see Figure 5A). Unlike the 
p"7-f58 loop described above, the sequence of this part of the 
MCM protein is not well-conserved across evolution (Figure 5B). 
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FIGURE 5 | Mutagenesis of the |i9-|310 p-hairpin (NT-hairpin). (A) 

Structure of the S. solfataricus MCM N-terminal domain hexamer (PDB 
2VL6) with the p9-pi0 p-hairpins highlighted in blue. (B) Multiple sequence 
alignment of the fJ9-fi1 0 fi-hairpin region from diverse archaeal species and 
from human (see legend to Figure 3 for key and Table 1 for strain details 
and protein accession numbers). Basic amino acids are highlighted in bold 
type. (C) Close-up of S. solfataricus MCM p9-p10 hairpin loop. (D) Location 
and nature of intended mutations in Hfx. volcanii MCM protein. Attempts 
were made to construct six paired alanine substitutions (bH1-6) and four 
short deletions (hD1-4). See text for details. 



However, it appears that the positively charged nature of the hair- 
pin is important for function: mutation of arginine 226 and lysine 
228 in the M. thermoautotrophicum MCM abolishes that protein's 
ability to bind to DNA (Fletcher et al, 2003). 

To probe the in vivo function of the fi9-fi 10 P-hairpin loop 
in Hfx. volcanii, and in the absence of highly conserved amino 
acids presenting themselves as obvious targets for mutagenesis, 
we initially attempted to construct six mutants, man-bHl — 
mcm-bH6, in which adjacent amino acids were replaced with 
paired alanines (Figure 5D). All six mutants were recovered 
following PCR screening (and confirmed by sequencing, see 
Supplementary Figure S2), implying that the f59-|310 P-hairpin 
loop is readily mutable. None of six mutants exhibited obvious 
growth defects at 45° C (Figure 4A, middle panel). 

We therefore extended this analysis by attempting to cre- 
ate strains carrying deletions of increasing size in the p9-fil0 
loop (mutants mcm-hDl — mcm-hD4, see Figure 5D). However, 
despite extensive screening, only two of the four could be iso- 
lated: mcm-hDl and mcm-hD2. Growth of these strains, like 
mcm-bHl — mcm-bH6, was indistinguishable from the parental 
wild-type H26 (Figure 4A, lower panel). That we were unable to 
isolate mcm-rD3 and mcm-rD4 strains strongly implies that these 
deletions either inactivate or significantly impair the function of 
the Hfx. volcanii MCM protein. We conclude from this analysis 
that while the precise sequence of the P9-f510 f5-hairpin loop is 
not absolutely required for the function of the protein, the loop 
itself does have a crucial role. 

MUTAGENESIS OF THE ZINC BINDING DOMAIN 

The N-terminal domain of the archaeal MCM proteins con- 
tains cysteine and histidine residues that fold into a zinc bind- 
ing domain (Figure 6) (Slaymaker and Chen, 2012). A similar 
domain appears also to be present in eukaryotic MCM pro- 
teins but its precise function in either kingdom is unknown. 
We attempted to construct mutants in which each of the four 
cysteines was individually replaced with alanine (Figure 6C). 
However, none of these four mutants that we attempted to 
construct (mutants mcm-C137A, mcm-C140A, mcm-C159A, and 
mcm-C162A, see Figure 6C) could be isolated, implying that all 
four cysteines — and presumably zinc binding — is essential for 
MCM function in vivo. As a control, we tested whether it was 
possible to introduce a silent mutation into the vicinity of the 
cysteine 140 codon by changing the sequence GGGACG encod- 
ing glycine 141 and threonine 142 to an Acc6S\ restriction site, 
GGTACC (mutation mcm-Sl). Sixty colonies were screened by 
Acc65l digestion of the PCR amplified mem gene: 26 contained 
the Acc65I site (data not shown), indicating that the region of the 
gene encoding the zinc binding domain can be mutated in this 
manner. 

SENSITIVITY TO MITOMYCIN C 

In total, we isolated 13 mutant strains carrying either sin- 
gle alanine substitutions, paired alanine substitutions and short 
sequence deletions in two conserved sequence elements, the 
P7-p8 and P9-J510 ^-hairpin loops. None of these strains dis- 
played obvious growth deficiencies: none of the strains grew 
slowly (Figure 4A) nor were any of the strains cold-sensitive or 
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FIGURE 6 | Mutagenesis of the zinc binding domain. (A) Structure of the 
S. solfataricus MCM N-terminal domain hexamer (PDB 2VL6) with the zinc 
binding domains highlighted in blue, alongside a close-up view showing the 
coordination of zinc by the four conserved cysteines. (B) Multiple sequence 
alignment of the zinc binding domain region from diverse archaeal species 
(see legend to Figure 3 for key and Table 1 for strain details and protein 
accession numbers). Conserved cysteines are shown boxed. (C) Location 
and nature of intended mutations in Hfx. volcanii MCM protein. Attempts 
were made to construct four cysteine-to-alanine point mutants. See text for 
details. 



temperature-sensitive (data not shown). We also tested whether 
the strains might be sensitive to DNA damage induced by 
the DNA modifying agent mitomycin C (MMC). MMC forms 
three types of MMC-DNA adducts: monoadducts, intrastrand 
biadducts and interstrand crosslinks (Tomasz, 1995; Bargonetti 
et al, 2010). Of the 13 tested strains, one strain, mcm-bH5, 
showed increased sensitivity to MMC treatment (Figure 4B). This 
mutant has two adjacent residues in the p9-fil0 ^-hairpin loop, 
asparagine 234 and glutamate 235, replaced with alanine. 

DISCUSSION 

The MCM helicase is the key catalytic engine of DNA unwind- 
ing during chromosome replication in eukaryotes and most likely 
in archaea also. Three factors make the archaeal MCM proteins 
excellent models for their human counterparts. First, the rela- 
tively high level of sequence similarity between the eukaryotic and 
archaeal MCM proteins throughout the non-catalytic N-terminal 
and catalytic AAA+ domains. Second, the relative simplicity of 
the homohexameric archaeal MCM complexes compared to the 
heterohexameric eukaryotic complexes. Third, the relative ease 
with which certain archaeal MCM proteins can be purified in 
recombinant form, assayed for various activities and crystallized 



for structure determination. It is striking that although more than 
10 years have passed since publication of the first partial crystal 
structure of an archaeal MCM (Fletcher et al., 2003), no high- 
resolution eukaryotic MCM structures have been solved. Until 
this occurs, the archaea will remain an important model allowing 
detailed dissection of MCM protein function. 

Using multiple protein sequence alignments and crystal struc- 
tures as a guide, a number of groups have identified regions of 
the MCM protein with potentially important roles in MCM func- 
tion. These have then been mutated and the consequences for 
MCM activity determined by a variety of in vitro biochemical 
assays. The p7-f58 P-hairpin loop was first identified in this way, 
for example (Sakakibara et al, 2008; Barry et al., 2009). However, 
despite the growing number of similar studies in the literature, 
no attempt has been made to examine the in vivo consequences of 
such mutations. 

Here we describe the results of probing the in vivo function 
of specific amino acids within an archaeal MCM protein using 
the haloarchaeon Hfx. volcanii as a model. Hfx. volcanii encodes 
a single MCM protein of 702 amino acids. This protein is part of 
the core group of haloarchaeal MCM proteins defined in a previ- 
ous study (MacNeill, 2009) and expanded upon here. Each of 26 
haloarchaeal species investigated encodes a single member of this 
group (Table 1, Figure 1). Ten species encode additional MCM 
family proteins classed as outliers. Given that a single core protein 
is found in all species examined and that 16 of the species encode 
only a single MCM, it is highly likely that these proteins act as 
replicative helicases. In support of this, the single MCM protein 
encoded by Hbt. salinarum NRC-1 (equivalent to the Hbt. sali- 
narum Rl protein listed in Table 1 and labeled as number 4 in 
Figure 1) has previously been shown to be essential for cell sur- 
vival (Berquist et al, 2007). While we have not attempted to delete 
the Hfx. volcanii mem gene in its entirety, our inability to isolate 
certain mutant mem alleles in this study strongly points to the 
Hfx. volcanii mem being essential also. 

The cellular functions of the outlying haloarchaeal MCM pro- 
teins are unknown. With the exception of the group of four 
C-terminally truncated MCM proteins highlighted in Figure 1 
(proteins labeled 5B, 11C, 17B, and 19B), all the outliers pos- 
sess intact Walker A, Walker B and arginine finger motifs, 
suggesting that they may be active as ATPase and/or DNA 
helicases. Interestingly, T. kodakarensis encodes three MCM 
proteins, all three of which have helicase activity, although 
only one is essential for cell viability and which therefore is 
the likely replicative helicase (Ishino et al., 2011; Pan et al., 
2011). 

In order to determine whether Hfx. volcanii presented a 
workable model for reverse genetic analysis of archaeal MCM 
function, we targeted three regions of the proteins for investiga- 
tion. Initially, we focused on the |37-P8 P-hairpin loop located 
at the interface between the N-terminal and catalytic domains 
(Figure 3A) and previously identified as having a role in com- 
municating conformational changes from the DNA binding N- 
terminal domain to the catalytic AAA+ domain (Sakakibara et al., 
2008; Barry et al., 2009). The f37-f58 loop is well conserved across 
eukaryotic and archaeal evolution (Figure 3B). In this study, 
we constructed five strains (mcm-rMl, mcm-rM3 — mcm-rM6) 
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carrying single point mutations in the Hfx. volcanii MCM f$7-P8 
P-hairpin loop (Table 2, Figures 3, 4). The mutated residues 
included glutamine 186, which is conserved in both archaeal and 
eukaryotic MCM proteins (Figure 3B) but which can be replaced 
by alanine without significantly affecting cell growth, as well as 
three charged residues (glutamates 190 and 196, and arginine 
193) and glutamine 199, all of which were also replaced by ala- 
nine without markedly affecting growth rates. In addition, none 
of the five f$7-f58 loop mutants led to increased sensitivity to MMC 
exposure in spotting assays (Figure 4). 

In contrast, we were unable to isolate mutant mcm-rM2 encod- 
ing a protein in which conserved residue glutamate 187 was 
replaced by alanine, nor any of the four deletion alleles mcm- 
rDl — mcm-rD4. Replacement of glutamate 182 in the M. ther- 
moautotrophicum MCM protein, the residue corresponding to 
glutamate 187 in Hfx. volcanii, with arginine led to marked reduc- 
tions in ATPase and helicase activities in vitro, without affecting 
either ATP or DNA binding (Sakakibara et al., 2008), whereas 
mutation of conserved glutamine 181 (equivalent to glutamine 
186 in Hfx. volcanii MCM, the residue mutated in mcm-rMl) to 
alanine had little impact. Clearly, conservation is no predictor of 
in vivo essentiality. 

We turned next to the P7-J58 f5-hairpin loop, also known as 
the NT-hairpin. This loop extends into the central channel of the 
MCM hexamer and may have a role in tracking DNA through the 
channel based on the fact that mutating two positively charged 
residues at the tip of the loop in the M. thermoautotrophicum 
MCM protein abolishes DNA binding in vitro (Fletcher et al., 
2003). In H. volcanii, only a single basic residue is present in 
the P9-P10 loop region, arginine 236. This is likely to lie at or 
near the tip of the f5-hairpin (Figures 5B,C). By comparison with 
M. thermoautotrophicum, it would be reasonable to predict that 
this amino acid would be essential for N-terminal domain DNA 
binding and thus for MCM function in vivo. In the absence 
of more widespread sequence conservation, we chose to probe 
the in vivo function of the P7-J58 ^-hairpin loop initially by 
attempting to construct a series of six mutants in which adjacent 
amino acids were replaced with pairs of alanines (Figure 5D). 
AH six mutant strains (Table 2, Figures 3, 5) were viable, includ- 
ing mcm-bH6 in which arginine 236 was replaced with alanine. 
Next, we attempted to construct four different p9-fi 10 loop dele- 
tion mutants but only two were viable: mcm-hDl and mcm-hD2 
(Table 2, Figures 3, 5). The former removes amino acids 231 and 
232, and the latter amino acids 230-233. In contrast, strains car- 
rying two larger deletions, mcm-hD3 and mcm-hD4, removing 
amino acids 229-234 and 228-235, respectively, could not be 
isolated. Thus, these results clearly demonstrate that while no 
individual amino acid within the P9-fi 10 hairpin loop is essen- 
tial for Hfx. volcanii MCM function in vivo, the loop does have a 
key role to play. 

In addition to mutagenizing the pV-{}8 and fi9-fi 10 loops, 
we also attempted to individually replace with alanine each 
of the four cysteine residues that make up the zinc binding 
domain (Figure 6). However, we were unable to recover any of 
four desired mutants, mcm-C137A, mcm-C140A, mcm-C159A, or 
mcm-C162A, and conclude that the zinc binding domain is there- 
fore likely to have an essential function in vivo. Consistent with 



this, replacing the equivalent of Hfx. volcanii MCM cysteine 162 
with serine produces an M. thermoautotrophicum MCM protein 
with impaired ATPase and single-stranded DNA binding activities 
and no helicase activity (Poplawski et al, 2001). 

Finally, the mutant Hfx. volcanii strains generated in this 
study were tested for increased sensitivity to the DNA modify- 
ing drug mitomycin C (MMC). MMC is a potent DNA inter- 
strand crosslinker and is widely used as replication fork blocking 
agent, as replication cannot continue past such crosslinks. We 
tested all 13 viable mem alleles for MMC sensitivity by spot- 
ting serially diluted cultures onto medium containing increas- 
ing concentrations of MMC and found one strain, mcm-bH5, 
that displayed significantly enhanced sensitivity compared with 
the parental wild-type H26 (Figure 4B). On medium lacking 
MMC, growth of mcm-bH5, as with the other viable fi9-fi 10 loop 
mutants, is indistinguishable from wild-type (Figure 4A). The 
mcm-bH5 protein carries a double substitution in P9-{510 loop 
with asparagine 234 and glutamate 235 being replaced with a 
pair of significantly less bulky alanines. In the absence of a crys- 
tal structure of the Hfx. volcanii MCM protein, it is difficult to 
predict whether the impact of the mcm-bH5 mutations is con- 
fined to the P9-pl0 loop alone or whether these amino acid 
changes will have a wider impact on N-terminal domain struc- 
ture. It is also difficult to envisage how the mcm-bH5 mutations 
would cause cells to become supersensitive to MMC, particu- 
larly as interpreting the effect of MMC on cells is complicated 
by the different modes of action of this compound. MMC forms 
at least four different types of DNA adduct: two species of 
MMC-mono-dG-adduct, intra-strand dG-MMC-dG biadducts 
and inter-strand dG-MMC-dG crosslinks, with the latter being 
assumed to be the replication fork blocking lesion (Tomasz, 
1995; Bargonetti et al, 2010). Given the key role of the MCM 
helicase at the fork it is tempting to speculate that the super- 
sensitivity of mcm-bH5 is a result of difficulties that arise when 
the mutant helicase encounters an inter-strand MMC crosslink. 
However, confirmation of this clearly requires additional experi- 
mentation. 

In conclusion, the results presented here demonstrate the fea- 
sibility of using Hfx. volcanii as a model system for reverse genetic 
analysis of archaeal MCM protein function, provide important 
confirmation of the in vivo importance of conserved structural 
features identified by previous bioinformatic, biochemical and 
structural studies, and offer the prospect of more extensive muta- 
tional analysis, not only of MCM but of other key replication 
factors, in the future. 
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Supplementary Figure SI | Sequence analysis of p7-p8 (l-hairpin mutant 
strains. Codons mutated to encode alanine are boxed. 
Supplementary Figure S2 | Sequence analysis of 89-610 8-hairpin mutant 
strains. Codons mutated to encode alanine pairs (mutants bH1-6) are 
boxed. Vertical lines in hD1 and hD2 indicate location of deleted 
sequences. 
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